CDS
Accession Number | TCMCG034C16207 |
gbkey | CDS |
Protein Id | XP_008376559.2 |
Location | complement(join(33584868..33584931,33585030..33585355,33585473..33585946,33586145..33586294,33586929..33587000,33587103..33587363,33587466..33587597,33587698..33587949,33588121..33588372,33588461..33588559,33589468..33589632,33589735..33589926,33590163..33591152)) |
Gene | LOC103439741 |
GeneID | 103439741 |
Organism | Malus domestica |
Protein
Length | 1142aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA534520 |
db_source | XM_008378337.3 |
Definition | DNA mismatch repair protein MSH3 isoform X1 [Malus domestica] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGGAAAGCAAAAGCAGCAAGTTATTTCCCGCTTCTTCGCTCCCAAACCCAAAACCCCAGACCCCTCATCTCCTTCCTCCTCCTCATTTCCTCCCTCATCGCCGCCGTCAAACCCTAAAGACCCACCCACCCCACCTCCCAAAATCACAGCCACAGTCGCCTTCTCTCCCGCCAAACGCGCCCTCATCTCCTCCCACCTCGCCGTCTCTTCCTCTCCCAAACCAGCCAAACTCCCCAAACTCTCTCCTCACACCCACAACCCCATACCCGCCGCAGCCAATCCCTCCCTCCACCAAAAATTCCTCCAGAAGCTTCTAGAACCCGCCTCCGACGTTCCAGAACCTCCCCCTTCCTCCAACCAACCCGCCAAGTTCACGCCTTTGGAGCAGCAGGTGGTGGAATTGAAGAAGCGCCACCCGGATGTTCTTCTGATGGTGGAAGTCGGTTACAAGTACCGGTTTTTCGGCGAAGACGCCGAAATTGCTGCGAGGGTTTTGGGGATTTATGCCCACATGGATCACAATTTCTTGACCGCAAGTGTCCCAACCTTCCGGTTGAATGTCCACGTCAGGAGGCTGGTGGGTGCCGGGTATAAGGTCGGCGTGGTGAAGCAGACCGAAACCGCCGCCATTAAGGCACATGGGTCGAACCGAGCCGGTCCATTTGACCGTGGATTGTCGGCATTGTACACGAAGGCGACTCTGGAGGCGGCGGAGGATGTGGGGGGAAAAGAGGAGGGCTGTGGTGGGGATAGTAATTATTTGGCTTGTGTTGTGGACAAGAGCGTTTCAGTGGAAAATGTGGATGGTGGAGTTGAGAGCGGTGTGGAAGTGAAAATTGGAATTGTGGCAGTGGAGGTTTCGACGGGCGACGTTGTCTATGGAGAGTTTGATGATAATTTTATGAGGAGTGGGCTTGAGGCTGTGGTTTTGAGCTTGTCACCTGCTGAGTTGCTTCTTGGGGAGCCACTCTCCAAACAAACAGATAAGATGTTACTTGCTTTTTCTGGACCGGCTTCAAATGTCCGCGTGGAGCGTGTCTCACGAGATTGCTTCAAGGACAGTGGTGCTTTCGCTGAAGTAATGTCTTTATATGAAAACATGGAGGGTGGTGATTTAACAGATCATTCAAAGGTAAATACAGATGTGAAAGAACAGAGTAATAAGCACTTGGGAATTGAGGGATTCATGAACATGCCAAATATGGCAGTCCAAGCATTGGCCCTGACTATTTATCATCTGAAACAATTTGGTTTGCAAAGGATCCTGCGCCTAGGAGCTTCTTTTAGGCCCCTCTCAAGCAGCATGGAGATGACTCTCTCGGCCAATGCACTTCAGCAATTGGAGGTATTGAAGAACAATGCTGATGGATCCGAGTCTGGCTCCTTGCTGCAGTCTATGAATCATACTCTTACCATATTTGGTTCAAGGCTTCTTAGACACTGGGTATCTCACCCTTTATGTGATCGAACCATGATTTCTGCTCGTCTAGATGCTGTTTCTGAGATTGCAGAATCTATGGGGTCTTCTATATCTCCTCACAATATTGAACAGCTAGATGTGGAAGATTCGTTTGCAACAAATGTGAACCCAGAGCTGACTTATATACTTTCTTCAGTTCTGACGACTTTGGGACGGTCGCCTGATATTCAACGTGGGATAACAAGAATCTTTCATCGGACTGCCACTCCACCTGAGTTCATTGCAGTTATTCAAGCTATTTTATATGCTGGAAAACAACTTCAACAACTTCAAATTAAAGAAGAGGGAAGCAAAGGAAATACGAGGGGAAAAAGTGTACGCTCTGAGTTGTTGAGGAAGTTGATATTGACTGCTTCGTCGTCCACTGTAGTTGGAAAAGCTGCAAAATTGTTGTCTGCCCTCAACAAGGAAGCAGCTGACAAACAGGATCTACTAAACCTAATCATCTCTGATGGCCAATTTCCTGAGGTTGCTAAAGCAAGGAAGGAGGTTCAATTGGCAAATGAGAAATTGGATTCTCTCATCAGTTTGTACCGGAAACAGCTTGGAATGCGCAAGTTGGAATTCCTTAGCGTGTCTGGAACAACACACTTGATAGAGTTGACCTTAGATGTAAAGGTGCCCTCAAATTGGGTTAAGATTAATAGTACCAAAAAGACAGTCCGGTATCACCCACCTGAAGTCCTGACTGCTTTAGACCATCTAGCTCTTGCAAGTGAGCAACTCACTGTTGTTTGTCGTGCTGCTTGGGATAGCTTTCTAAGTGGTTTTGGTAAATATTATGCTGAGTTCCAAGCTGCTGTTCAAGCACTAGCCAGTTTAGACTGTCTTCATTCACTTGCTGTCCTTTCAAGAAATAAGAACTATGTTCGTCCAATGTTTGTATATGATGATGAACCGGTCCAGATACACATCTCCTCTGGTCGTCATCCGGTTTTGGAGACTATATTACAAGACAATTTTGTTCCCAATGATACAGATTTGGAGGCAGATGGGGAGTATTGTCAGATTATTACTGGACCCAATATGGGTGGAAAGAGTTGCTACATTCGCCAAGTTGCTCTCATTGCTATCATGGCTCAGGTTGGTTCCTTTGTTCCAGCATCATCGGCAAAACTGCATGTGCTGGACAGCATTTTCACTCGAATGGGTGCTTCTGACAGTATTCACCAAGGGAGAAGCACCTTTCTTGAAGAACTGAGCGAGGCTTCACATATACTTCACAATTGTACATCACGCTCATTGGTTATCATTGATGAGCTTGGGAGAGGCACGAGCACACACGATGGTGTGGCTATTGCTTATGCTACATTAAATCATCTACTACAGCAGAAAAGATGCATGGTCCTATTTGTCACGCACTACCCAAAAATTGCTCATATCAGAACTGAATTCCCAGGCTTAGTGGAGGCGTACCATGTTTCTTATCTGACGTCAAATAGAGATATGGGTTCGACAGGCATTCAATCCGAAAATCAAGATGTCACTTACTTATATAAGCTTGTGCCTGGTGTTTCAGAGAGGAGTTTCGGATTTAAGGTTGCAGAGCTTGCACAGCTACCTTCCTCATGCATCAGACGAGCGACTGTCATGGCTGCTAGGTTGGAGGCAGTAGTAAGCAGCCGAACAAGAAATAAGGATGACAAAAAGTGGTTGCTAAAATCACTGCCAACAGAACAAAAGCAGGAAGAGCAAGATGTGATGCTGGAATCTCCCGAGTGCCTTCGTGTGGGATGGAGCTCGATTTTAGGGGACATAGATGGTGCCCTGTACGAGAAATTCTTTAAGAATTTGAAAACCACATTACTTGATGACAGCGACCCCATAAAAAGCGTTGAGAACTTGAATCACACAAGAAGTATTGCAAGAGAATTAGTAAGCAGATGGCCCTTACCGGAAGTGATCTGTCATGCATACAGCAACAGCAAGCGAAGGGGACGGATTTAG |
Protein: MGKQKQQVISRFFAPKPKTPDPSSPSSSSFPPSSPPSNPKDPPTPPPKITATVAFSPAKRALISSHLAVSSSPKPAKLPKLSPHTHNPIPAAANPSLHQKFLQKLLEPASDVPEPPPSSNQPAKFTPLEQQVVELKKRHPDVLLMVEVGYKYRFFGEDAEIAARVLGIYAHMDHNFLTASVPTFRLNVHVRRLVGAGYKVGVVKQTETAAIKAHGSNRAGPFDRGLSALYTKATLEAAEDVGGKEEGCGGDSNYLACVVDKSVSVENVDGGVESGVEVKIGIVAVEVSTGDVVYGEFDDNFMRSGLEAVVLSLSPAELLLGEPLSKQTDKMLLAFSGPASNVRVERVSRDCFKDSGAFAEVMSLYENMEGGDLTDHSKVNTDVKEQSNKHLGIEGFMNMPNMAVQALALTIYHLKQFGLQRILRLGASFRPLSSSMEMTLSANALQQLEVLKNNADGSESGSLLQSMNHTLTIFGSRLLRHWVSHPLCDRTMISARLDAVSEIAESMGSSISPHNIEQLDVEDSFATNVNPELTYILSSVLTTLGRSPDIQRGITRIFHRTATPPEFIAVIQAILYAGKQLQQLQIKEEGSKGNTRGKSVRSELLRKLILTASSSTVVGKAAKLLSALNKEAADKQDLLNLIISDGQFPEVAKARKEVQLANEKLDSLISLYRKQLGMRKLEFLSVSGTTHLIELTLDVKVPSNWVKINSTKKTVRYHPPEVLTALDHLALASEQLTVVCRAAWDSFLSGFGKYYAEFQAAVQALASLDCLHSLAVLSRNKNYVRPMFVYDDEPVQIHISSGRHPVLETILQDNFVPNDTDLEADGEYCQIITGPNMGGKSCYIRQVALIAIMAQVGSFVPASSAKLHVLDSIFTRMGASDSIHQGRSTFLEELSEASHILHNCTSRSLVIIDELGRGTSTHDGVAIAYATLNHLLQQKRCMVLFVTHYPKIAHIRTEFPGLVEAYHVSYLTSNRDMGSTGIQSENQDVTYLYKLVPGVSERSFGFKVAELAQLPSSCIRRATVMAARLEAVVSSRTRNKDDKKWLLKSLPTEQKQEEQDVMLESPECLRVGWSSILGDIDGALYEKFFKNLKTTLLDDSDPIKSVENLNHTRSIARELVSRWPLPEVICHAYSNSKRRGRI |